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SECTION  I 


INTRODUCTION 

This  project  is  a continuation  of  the  research  btigun  under 
Contract  F33615-73-C-4007  to  investigate  areas  ot  self- 
organizing systems  theory  that  are  relevant  to  the  Air  Forc<^. 

The  previous  study  emphasized  techniques  for  terminal-value 
control  of  vehicles  and  for  conti'ol  systems  thai  must  avoid  oper- 
ating regions  with  high  performance  penalties  (11).  Both  cases 
required  the  development  and  incorporation  of  long-*;erm  memory 
in  the  parameter  search  algorithm.  This  memory  was  efficiently 
encoded  in  the  form  of  multimodal  probability  density  functions 
(pdf  s)  . 

The  objective  of  the  present  project  was  to  develop  these 
techniques  further  and  to  extend  them  to  higher-dimensional 
problems.  Four  major  areas  of  interest  were  investigated.  First, 
methods  were  developed  and  tested  for  increasing  convergence 
rates  of  searci  s that  must  avoid  high-penalty  regions.  Second, 
techniques  for  controlling  probability  density  function-guided 
searches  wore  refined.  Third,  use  of  clustering  analyses  to  aid 
in  determining  the  complexity  of  an  optimization  problem  was 
studied  further.  Fourth,  the  strategies  resulting  from  the  above 
investigations  were  applied  to  an  image-processing  problem  with 
potential  application  in  remotely-piloted  vehicle  (RPV)  ground 
taryet  ? cqu  i.s  i t ion  . 

It  was  found  that  probability  density  function-guided  searches 
can  provide  good  in  form.- tion  with  which  to  initiate  guided, 
accelerated  random  searches.  The  increased  knowledge  about  The 
characteristics  of  the  optimization  problem  that  the  pdf-guided 
search  provides  can  be  used  to  help  other  searches  avoid  regions 
of  high  resource  consumption  or  disastrously  low  performance. 
Additionally,  the  pdf-guided  search  supplies  a list  of  locations 
with  high  associated  performance  that  can  be  used  as  starting 
points  for  subsequent  searching. 
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Clusterimj  analysed  proved  to  be  useful  tools  in  locatinR  and 
describing  both  local  and  global  extrema,  thus  enabling  the 
investigator  to  judge  the  complexity  of  the  surface  to  be  studied. 

In  addition,  they  indicaate  shilts  in  extrema  due  to  the  inter- 
action of  the  independent  variables. 

One  of  the  major  tasks  of  the  RPV  man-machine  interface  is  the 
encoding,  transmission,  reconsti uct ion , and  interpretation  of 
pictorial  information.  This  is  usually  accomplished  by  fast 
Fourier  encoding/decoding  techniques.  One  main  purpose  of  the 
remote  pilot  is  to  spot  those  regions  of  a picture  that  contain 
"interesting"  information;  e.g.,  a truck  moving  in  a background 
of  clutter.  A disadvantage  of  the  fast  Fourier  method  is  that 
it  is  insensitive  to  "interesting"  regions;  it  treats  all  data 
equally.  A novel  application  of  the  parameter  search  procedures 
studied  in  this  project  was  made  to  this  image  interpi-etat  ion 
problem.  While  the  data  compression  and  reconstruction  proper- 
ties of  this  new  approach  compare  favorably  to  the  fast  Fourier 
method,  the  key  result  is  that  interesting  regions  of  a picture 
are  automat ical ly  identified  by  the  new  encoding  process  and 
conveyed  directly  to  the  remote  pilot.  Therefore,  this 
approach  showed  good  promise  as  a new  technique  for  image  encoding/ 
decoding/ interpretat ion . Although  these  procedures  require  further 
development,  they  indicate  solid  potential  for  equalling  or 
bettering  techniques  presently  in  use. 


SKCTION  II 

DESCRIPTION  OF  Nl.-MEIU CAL  OPTIMIZATION  TEST  .SFRFACT; 


The  first  three  (of  four)  work  tusks  in  this  ;irojec1  utilized 
the  same  performarieo  function  to  test  the  parameter  search 
algorithms.  The  function  consists  of  a weighted  sum  of  five 
Gaussianly-shaped  modes,  with  centers  as  listed  in  Table  1. 

The  location  of  each  mode  center  remains  constant  for  all  v'aluos 
of  NDIM;  e.g.,  the  first  coordinate  location  of  Mode  2 is;  always 
at  0.40.  However,  both  the  size  factors  of  each  mode,  listed 
in  Table  2,  and  the  amplitude  factors,  listed  in  Table  3,  are 
altered  as  the  dimensionality  cf  the  parameter  space  (NDIM)  is 
increased.  It  was  nece.ssary  to  broaden  the  five  performance 
modes  as  NDIM  and,  consequently,  the  volume  of  the  performance 
space  was  increased.  This  insured  that  all  portions  of  the 
space  would  be  influenced  by  at  least  one  of  the  modes.  The 
absolute  amplitudes  of  the  three  smallest  performance  modes 
were  increased  for  values  of  NDIM  >_  10.  In  this  way,  most  of 
the  parameter  space  had  a reasonably  large  (performance)  func- 
tional value.  As  a result  of  tnesc  alterat  it)ns  in  .size  and  in 
amplitude,  the  location  and  function  value  of  the  global  maxi- 
mum shifts  a:  NDIM  changes,  as  shown  in  Table  4. 


The  performance  test  function  value,  f(X),  for  a point  in  the 
NDIM- dimensional  hyperspacc,  X = j • 


ff X)  = f(x.  , . . . , 


5 

^.N1)IM>  = 

m=l 


"t  H 

where.  ihe  Gaussian  mode  and  it  is  equal  to: 


g^(X)  = 


( 2i; ) 


NDIM/2  /'NDIM  2 


nil 


exp 


NDIM  /X  - u . 
' i mi 


i-1 


o 

mi 


where,  u • . o . and  w are  given  in  Tables  1 , 2,  and  3. 
mi  mi  m 

respectively . 
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TAB1.1-;  1 


CENTERS  OF  MODES  FOR  PERFORMANCE  TEST  FUNCTION 


Mode 


Dimension 

1 

2 

3 

4 

5 

1 

. 80 

.40 

-.69 

-.83 

-.67 

n 

z. 

.39 

-.82 

- .98 

.53 

-.05 

3 

-.68 

.59 

.61 

. 03 

-.18 

4 

.94 

.49 

-.72 

.10 

.75 

5 

-.20 

-.62 

.69 

.79 

-.86 

6 

. 14 

.56 

. 12 

.74 

.25 

n 

1 

-.58 

-.29 

.41 

.64 

- .77 

8 

.30 

.29 

.81 

-.04 

-.39 

r\ 

u 

o r\ 

. O 

O A 

. V_»  1 

HQ 

. GS 

. 54 

10 

.97 

.18 

-.03 

.08 

-.12 

11 

-.32 

.42 

-.66 

-.01 

.50 

12 

. 19 

.37 

.98 

.21 

-.53 

13 

- . Iw 

-.09 

-.78 

1 . 00 

. 55 

14 

. 96 

-.50 

.99 

-.75 

. 32 

15 

-.54 

- . 52 

-.45 

- . 59 

-.92 

16 

-.63 

.67 

. 46 

-.07 

-.73 

17 

-.26 

-.99 

. 77 

.92 

.32 

18 

-.14 

-.19 

-.74 

-.47 

-.94 

19 

-.76 

-.27 

. 45 

-.43 

.17 

20 

. 20 

-.96 

.62 

-.81 

.63 

TABLE  2 

SIZE  FACTORS  OF  MODES  FOR  PERFORMANCE  TEST  FUNCTION 


Mode 


Dinionsion 

I— 

2 

4 

2 

1 

.350 

.400 

.250 

.700 

.310 

?. 

.45.5 

.175 

.425 

.200 

.250 

1 

.600 

.650 

.750 

.700 

. 550 

2 

.705 

.600 

.675 

.650 

.750 

3 

.760 

.650 

.680 

.720 

.710 

4 

.700 

.600 

.685 

.640 

.610 

5 

690 

.800 

. 710 

.685 

. 770 

i. 

.900 

.850 

.850 

.900 

.850 

2 

1.0C5 

.800 

. 775 

.850 

1.030 

3 

1.060 

.850 

.780 

.920 

1 . 010 

4 

1.000 

. 800 

.785 

.840 

.910 

5 

.890 

1 .000 

.810 

.885 

1.070 

6 

.900 

.955 

.775 

.795 

.850 

7 

.870 

. 700 

.800 

.750 

.860 

8 

.900 

.830 

.783 

.830 

1.015 

9 

.940 

.940 

.650 

.935 

.500 

10 

.855 

.800 

.785 

.920 

.960 

1 

1.  V57 

2.00S 

1.235 

3.514 

1.536 

2 

2.284 

.879 

2.134 

1.004 

1.280 

3 

1.280 

2.008 

1.079 

1.180 

1.178 

4 

1.330 

1.757 

.728 

.979 

.947 

S 

1.104 

2.761 

1.155 

1.079 

2 . 662 

6 

1.757 

1.832 

. 176 

2.108 

1.434 

V 

2. 2'jy 

1 . 054 

1 . 50G 

.628 

1.894 

8 

1.004 

1.355 

2.259 

2.761 

2.1,6 

9 

1.4C6 

2.530 

1.481 

1.355 

2 . 330 

10 

1.079 

1.305 

2.761 

2.033 

4 .178 

11 

.929 

2.2!  9 

1 . 807 

2.008 

1.178 

12 

1.456 

1.029 

1.506 

2.259 

1 . 610 

13 

1.205 

2.284 

.628 

1.180 

1.792 

14 

2.385 

3.514 

2.284 

1.205 

1 . 229 

15 

2.761 

3.514 

X . 506 

.954 

1.280 

J 

2.815 

3.217 

2.010 

5.629 

2.432 

2 

3.659 

1.407 

3.418 

1.608 

2. 030 

3 

2.051 

3.217 

1,729 

1.890 

1.870 

4 

2.131 

2.775 

1.166 

1.568 

. 151 

5 

1.769 

4.423 

1 . 850 

1.729 

4.202 

6 

2.815 

2.935 

2.815 

3.337 

2.272 

7 

3.538 

1.683 

2.412 

1.005 

2 . 995 

8 

i.608 

2.171 

3.613 

4.423 

3.438 

9 

2.252 

4 . 02. 1 

2.372 

2.171 

5.679 

10 

1.729 

2.091 

4 1 ' 

3.257 

1,870 

11 

1.488 

3.613 

3.217 

1.870 

12 

2.332 

1.649 

3.619 

2.392 

13 

1.930 

3.650 

1.890 

2.835 

14 

3.820 

5.629 

1.930 

1.959 

15 

4.123 

5.629 

i . 

1.52,8 

2 030 

IG 

1.649 

1.769 

. ' "i  ' 

2.131 

1 . 870 

17 

2.734 

2.815 

2.031 

1.809 

2.392 

18 

] .287 

1.769 

2.412 

1.608 

3.558 

19 

1.608 

1.890 

5.629 

1.809 

3.438 

20 

1 . 005 

2.. 332 

1.045 

2.412 

3.247 

TABLE  3 


AMPLITUDE 

FACTORS 

OF  MODES 

FOR  PERFORMANCE  TEST 
Mode 

FUNCTION 

Dimension 

1 

2 

3 

4 

5 

2 

M j 
o 

O 1 

-0.50 

0.10 

0.50 

1 . 00 

j 

-1 .00 

-0.50 

0.10 

0.50 

1.00 

10 

-i.OO 

-0.75 

0.50 

0.75 

1.00 

15 

-1.00 

-0.75 

0.55 

0.75 

1.00 

20 

-1.00 

-0.75 

0.55 

0.75 

1.00 

I 


TABLE  4 


LOCATIONS  AND  FUNCTION  VALUES  OF  GLOBAL  MAXIMUM 


NDIM 

Dimension 

_2 

5 

10 

15 

20 

1 

-.671 

-.710 

-.734 

-.848 

-.976 

2 

-.046 

-.025 

-.014 

.463 

.486 

3 

-.187 

-.237 

-.063 

-.056 

4 

. 746 

. 755 

.390 

.745 

5 

-.855 

-.859 

.65  3 

.660 

6 

.246 

.423 

.377 

7 

-.788 

. 561 

.535 

8 

-.442 

-.537 

-1.000 

9 

. 543 

.705 

. 835 

10 

-.154 

-.145 

-.312 

11 

.420 

.595 

12 

-.446 

-.734 

13 

.939 

-.879 

14 

1 

CO 

CO 

-.181 

15 

-.693 

-.734 

16 

-.640 

17 

.817 

18 

-.819 

19 

-.142 

20 

-.481 

Function 
Maximum : 

1.0164 

. 9696 

.9459 

.9660 

1.0259 
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SECTION  III 

SELF-ORGANIZING  SEARCH  ALC>ORITHMS 


It  was  shown  in  the  previous  work  (il,  12,  13)  that  the  results 
of  a pdf-guided  search  could  supply  a good  location  from  which 
to  begin  a guided  random  search  — at  least  for  two-dimensional 
spaces.  One  of  the  work  tasks  in  this  project  was  to  investigate 
this  concept  for  higher  dimensional  spaces.  By  way  of  intro- 
duction, the  next  two  subsections  are  excerpted  from  (11). 


3.1  Sex f-Organizing  Long-Term  Memory  Search  (PDF) 

Because  of  physiological,  structural,  thermal,  or  other  con- 
straints, many  systems  must  avoid  operating  regions  character- 
ized by  very  high  performance  penalties.  Those  cases  in  which 
the  l egions  to  be  avoided  can  be  eACJuded  by  placing  a p i i u i’  1 
bounds  on  the  search  present  little  difficulty;  therefore,  the  other 
cases,  in  which  these  regions  can  only  be  determined  after  the 
fact,  .ere  investigated  in  this  study. 


The  two  types  of  high-penalty  search  problems  are: 

1.  Those  in  which  a particular  choice  of  a set  of  para- 
meters leads  to  poor  system  performance  (as  measured 
by  a performance  assessment  function)  witn  an  accom- 
panying large  consumption  of  system  resources  (e.  g.  , 
operating  an  aircraft  engine  well  below  its  maximum 
thermodynamic  efficiency  decreases  its  work  output 
and  Increases  the  fuel  consumption). 

2.  Those  in  which  a particular  choice  of  a set  of  para- 
meter leads  to  a disastrous  outcome  (e.  g.  , increasing 
the  pressure  in  a boiler  beyond  an  upper  safety  limit). 


Since  the  resources  available  to  conduct  the  parameter  search 
problem,  are  limited  (such  a.s  the  amount  of  aircraft  fuel  or  the 
maximum  number  of  triads  in  a computer-based  optimization),  this 
factor  must  necessarily  play  an  important  role  in  the  logic  of 
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tile  search  procedure.  Accordingly,  the  pdf-guided  search 
algorithm  is  explicity  guided  in  its  internal  strategy  as  a 
function  of  the  remaining  system  resources. 

This  new  search  techtiique  additionally  employs  the  information 
gained  by  previous  trials  (iterations)  in  a novel  way  so  as 
to  increase  the  probability  that  future  trials  will  yield  better 
performance  scores  than  past  trials.  The  information  gained 
in  previous  trials  is  encoded  in  a multivariate  probability 
distribution  function,  p(X|k).  which  denotes  the  probability 

that  the  i rial  parameter  vector  X = (x^ Xj,^)  will  yield  a 

performance,  P(X),  which  falls  within  the  bin  in  the  perfor- 

mance range  (k=  1,  ...,  K>,  whore  k = 1 aenotes  the  range  of 
best  performance  scores.  Trial  vectors.  X,  are  selected  on  the 
basis  of  yielding  good  performance.  Alternately,  trial  vectors 
could  be  so  selected  that  the  probability  is  low  that  they  will 
not  yield  poor  performances.  A trial  vector  yielding  a poor 
performance  denotes  a wasted  experiment.  Each  wasted  experiment 
DC  costly.  Therefore,  iiigu  penalties  are  assigned  tc  regions  in 
the  parauneter  space  in  which  trial  X vectors  are  obtained  with 
correspondingly  poor  performance  scores. 

After  each  trial  vector  X is  employed,  the  performance  score, 

P(X) , is  noted,  and  the  region  of  parameter  space  containing 
X has  a probability  assigned  to  it  based  on  the  value  of  P(X). 

If  P(X)  falls  within  the  bin,  p(X|k)  is  updated.  In  this 

manner,  all  the  information  that  has  been  generated  since  the 
beginning  of  the  search  is  encoded  in  long-term  memory  PDF's. 

These,  in  turn,  bias  the  search  away  from  probable  low  yield 
parameter  regions  and  towards  probable  high  yield  regions. 

The  terminology  used  in  rofeiring  to  the  three  parts  of  the  PDF 
search  in  this  report  is  as  follows;  PDFl  --  unbiased  random 
sampling  (of  XTOT  points)  followed  by  division  of  the  KTOT  points 
into  K performance  classes  depending  upon  the  associated  performance 
value,  PDF2  --  K separate  cluster  analysu.s  in  the  N'-dimensional 
space  to  obtain  one  multimodal  pdf  for  each  of  the  K classes;  PDFS  -- 
adaptive  search  phase  in  which  the  p(X  k)  are  updated  as  outlined 
above  and  described  in  detail  in  (11,  12,  13). 
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3.2  Guide d Accelerated  Random  Search  (GARS) 

The  guided  acc(;lerated  random  search  (GARS)  algorithms  are  probability 
state- vari able  (psv')  searches  that  arc  particularly  intended  for 
applications  involving  multimodal  performance  surfaces.  These 
algorithms  are  suitable  for  spaces  of  low  or  high  diniens ior  al i ty 
and  for  the  search  of  stationary  or  time-varying  surfaces.  The 
more  flexible  of  the  GARS  algorithms  contain  the  following  provisions: 

(a)  Uniform  Random  Phase  — A search  phase  in  which  a uniform 
pdf  is  employed  to  govern  the  generation  of  random 
trials.  The  heuristic  principals  of  reversal,  acceler- 
ation, and  deceleration  arc  incorporated  for  the 
exploitation  of  the  results  of  random  exper iment a 1 3 on . 

(b)  Biased  Search  Phase  --  A search  pha.se  in  which  the  pdf 
is  adjusted  in  accordance  with  the  result.sof  ongoing 
trials  so  as  to  increase  the  rate  of  thsir  convergence. 

This  is  the  most  important  of  the  GARS  phases  and 
utilizes  three  basic  methods  of  governance  of  random 
experiments.  The  principals  of  reversal,  accelerations 
and  decelerations  arc  again  employed,  as  in  the  uniform 
random  phase,  subsequent  to  each  random  experim.ent 
producing  ati  improved  or  worsened  score. 

(c)  Biiised  Random  Phase  with  Activity  Factor  --  A pliase  in 
which  the  fraction  of  the  total  number  of  free  vai-iables 
subject  to  manipulation  is  progressively  reduced  from 
unity  until  approximately  two  variables  only  (or.  the 
average)  are  being  adjusted.  An  activity  factor, 
which  is  a function  of  trial  number  and/or  performance, 
determines  the  a priori  probability  that  a specific 
parameter  will  be  varied  in  any  given  random  trial. 

The  identity'  of  parameters  manipulated  is  kept  random 
while  the  activity  factor  is  systematically  leduc.ed. 

This  phase  is  the  same  as  (b)  except  for  the  use  of 
an  activity  factor . 

(d)  Systematic  Phase  --  A phase  in  which  a steepcst-descent 
(or  ascent)  principal  is  used  (with  .small  acLivity 
factor  in  the  case  of  high-dimensiona i problems)  for  the 
"fine  tuning"  of  performance  in  tb.e  vicinity  of  best 
results  from  a preceding  random  phase. 

(e)  Systematic  Phase  With  Single-Dimension  Manipulation  -- 
Subsequent  to  the  steepest-descent  adju.stment,  a systematic 
one  dimension-at-a-time  adjustment  is  made  of  the  manipulable 
parameters . 


1 1 


In  addition  to  the  concepts  of  reversal,  acceleration,  and  decelera- 
tion mentioned  above  and  described  in  the  general  literature 
(2,  3,  9).  several  additional  principals  are  embodied 
in  the  general  GARS-type  algorithms.  These  relate  to  the 
observance  of  boundaries  on  the  admissible  values  of  parameters, 
pioceduros  for  periodic  reinitialization  of  GARS  so  that  the 
record  oi  best-to-date-performance  dees  not  become  misleading 
if  the  performance  surface  is  varying  with  time,  and  use  of  certain 
methods  for  controlling  the  directions  and  lengths  of  random  steps. 

AM  of  the  probability  state  variable  techniques,  including  GARS, 
offer  rapid  convergence  and  have  the  ability  to  deal  with  the 
problem  of  time-varying  surfaces  and  high  levels  of  noise  in 
measurements . 

The  method  or  methods  used  for  storage  of  performance  data  are 
of  great  importance  in  advanced  versions  of  self-organizing  and 
learning  systems  because  tiie  efficiency  of  such  systems  is 
highly  dependent  on  the  memory  processes  used.  This  is 
particularly  true  for  multivariate  systems;  if  one  uses  conven- 
tional methods,  the  retention  and  accession  of  information 
become  increasingly  difficult  as  t’.ie  number  of  variables  increases. 
Accordingly,  those  procedures  suitable  for  encoding  long-term 
memory  for  self-organizing  and  learning  systems  were  investigated. 
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3.3  (!^oniposlte  Sec.rch  T».^chnique  Combininp;  PDF  and  GARS  AH^urithms 


Each  of  the  techniques  described  above  has  certain  advantages 
and  disadvantages.  GARS  is  fast  and  accurate,  but  its  conver- 
gence; rate  is  somewhat  dependent  on  the  choice  of  a starting  point. 
GARS  can  sometimes  spend  considerable  time  extracting  itself  from 
local  maxima.  PDF  can  more  accurately  learn  the  general  topo- 
graphy of  the  performance  surface  and.  consequently,  it  can  distin- 
guish between  local  and  global  maxima  quite  well.  Its  main  disad- 
vantages are  that  it  is  slow  and  it  is  unlikely  to  discover  the 
centroid  of  the  global  maximumi  without  high  expenditure  of  resources 
(i.e.,  large  amounts  of  searching  in  the  immediate  vicinity  of  the 
global  maximum). 

A more  general  and  powerful  approach  would  be  to  combine  PDF  and 
GARS.  The  multimodal  statistical  capabilities  of  PDF  would  be 
substituted  for  the  unimoaal  statistical  stra..egy  of  GARS' 
random  phases.  Such  a method  would  not  only  retain  the  '’irtues 
of  each  algorithm,  but  would  yield  other  benefits  not  possessed 
by  either  alone. 

To  begin  with,  PDF  provides  GAl  S with  a good  starting  point  This 
can  be  either  the  best  single  point  found  by  PDF  or  the  cluster 
center  of  the  mode  possessing  the  highest  performance  value.  If 
there  is  more  than  one  extremal  point  of  interest,  the  PDF  results 
can  be  employed  to  locate  the  regions  in  which  they  occur,  and  to 
Initialize  a search  in  each  one.  The  knowledge  of  the  topography 
of  the  performance  surface  acquired  by  PDF  can  also  be  used  to 
more  effectively  prevent  GARS  from  falling  into  regions  with  high 
resource  penalties  or  low  performance  score. 

Perhaps  the  most  important  benefit  of  combining  PDF  with  GARS  is 
the  i.acreased  sophistication  and  efficiency  that  PDF  can  give  to 
the  second  and  third  phases  of  GARS  (the  statistically  biased 
search  phases).  As  GARS  is  presently  formulated,  a unimodal  pdf 
is  generated,  centered  at  the  current  best-to-date  point,  and  this 
distribution  is  used  to  choose  tiie  next  trial  point.  This  method 
js  a considerable  improvement  over  simple  random  sampling,  but  it 
mainly  confines  the  search  to  a (Gaussianly-shaped)  neighborhood 
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ol  the  current  best -to-date  point.  Limiting  the  search  in  this 
way  both  slows  convergence  and  may  result  in  the  search  occasion- 
ally and  unnecessarily  becon:ing  stranded  on  a local  maximum  for 
a long  period.  Substituting  the  multimodal  distributior  function 
adaptively  developed  by  I'DF  will  provide  GARS  with  much  more 
information  about  the  performance  surface.  This  will  enable 
GARS  to  make  bettor  choices  of  new  trial  points.  The  results  of 
each  trial  can  be  used  adaptively  to  update  the  pdt  model,  lead- 
ing to  moie  efficient  .searching. 

3.4  E.xper imental  Procedure 

The  procedure  followed  in  this  investigation  was  a modest  first 
effort  at  approximating  the  composite  search  algorithm  described 
above.  It  does,  however,  demonstrate  that  combining  the  two 
techniques  is  both  feasible  and  desirable. 

The  experimental  procedure  was  as  follows.  The  test  surface 
was  selected  to  be  the  performance  function  described  in 
Section  2.  It  was  first  searched  (for  the  global  maximum)  using 
the  PDF  algorithm.  This  consists  of  an  initial  random  sampling 
followed  by  a clustering  analysis,  then  a pdf-guided  search, 
until  the  system's  resources  were  exhausted  as  described  above 
in  Section  3.1. 

GARS  was  initialized  in  three  different  ways.  First,  the  random 
phase  was  deleted  and  the  first  biased  phase  started  from  the 
best  point  found  by  PDF.  Second,  the  random  phase  was  again 
deleted  and  the  biased  phase  started  from  the  center  of  one  of  the 
clusters  formed  by  PDF  from  the  points  in  the  top  performance 
class.  These  first  two  schemes  for  selecting  a starting  point  for 
GARS,  in  effect,  substituted  PDF  for  the  random  search  phase  of 
GARS.  The  third  technique  was  to  use  a cluster  center  from  the 
lowest  performance  class  as  the  starting  point  for  GARS.  This 
ensured  that  the  starting  point  would  be  far  enough  from  the 
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global  max iin jii!  t«.!  allovi,  lot  a valid  comparison  to  "bottor"  start- 
ing points.  In  this  thii'd  case,  the  inilia.1  random  search  phase 
was  retaint.-d. 


Table  5 summarizes  the  -12  e.xperimental  searches  that  wore  made 
on  the  live  performance  surfaces.  NDIM  is  the  dimensionality  of 
the  surface;  KTOT  is  the  number  of  .samples  in  the  random  phase  of 
IMiF  (note  that  for  NDIM  = 2,  f> , and  tO,  there  is  more  than  one 
value  of  KTOT).  Changing  the  number  of  random  samples  affects  the 
structure  of  the  cluster  model  and  the  percentage  of  the  initial 
resources  that  is  cemsumed.  The  strategy  for  the  pdf -guided 
search  is  tlierefore  altered  as  well.  The  GARS  starting  point  is 
identified  in  one  of  five  ways; 

X*  - I’DF  best  point 

— • Center  of  cluster  in  top  performance  class 

(if  there  are  more  than  one  such  cluster, 
they  are  numbered  consecutively,  «5tc.) 

Xg  - Center  of  cluster  in  lowest  pe'*formance  cla;  3 

x^^  - Origin  of  space  (2  and  20  dimensions  only) 


x^  A random  point  (2  dimensions  only) 


To  ensure  that  Xj^  was  not  too  far  from  thf»  global  maximum  to  be 
validly  cfxnpared  to  x*  and  .x , and  x^  wt;re  used  as  checks. 

The  values  of  all  the  starting  points  are  listed  in  Tables  6 
through  10  along  with  tlie  final  (best)  point  and,  for  each  value 
of  NDIM,  the  location  of  the  global  maximum. 


3.5  Illustrative  Example 


The  results  of  the  42  searcncs  are  given  in  Tables;  A-\  thmugli 
A-11  of  Append  LX  A.  To  clarify  the  experimental  procedure  and  tc 
aid  in  interpreting  results.  Run  1 of  Table  A-1  can  serve  as  an 
ill  ust  rat  i(jn  . 
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TAULK  5 


rOHTV-TWO 

SEAHClli;S  MADE 

WITH  COMPOSITE 

ALGOR  I Til 

GARS 

Stan  ill 

Run  Nuinbcr 

NDIM 

rroT 

Point 

1 

2 

50 

* 

X 

2 

2 

50 

X 

3 

2 

100 

X 

4 

2 

100 

X 

5 

2 

200 

X 

6 

2 

200 

’‘l 

7 

2 

200 

’‘2 

8 

2 

200 

9 

2 

200 

’‘4 

10 

2 

- 

=<0 

11 

2 

- 

X 

c 

12 

2 

200 

13 

5 

100 

X* 

1-1 

5 

100 

^1 

IS 

5 

100 

^2 

16 

5 

200 

X* 

17 

5 

200 

18 

5 

200 

^2 

19 

5 

200 

'^*3 

20 

5 

200 

21 

5 

200 

^5 

22 

3 

200 

’‘6 

23 

5 

200 

24 

5 

200 

25 

5 

200 

’‘b 

26 

10 

100 

X* 

27 

10 

100 

X 

28 

10 

200 

X* 

29 

10 

200 

^1 

30 

10 

200 

^2 

31 

10 

200 

’‘b 

32 

15 

200 

X* 

33 

15 

200 

*1 

34 

15 

200 

*2 

35 

15 

200 

!l3 

36 

15 

200 

37 

20 

O 

o 

X* 

38 

20 

200 

X. 

1 

39 

20 

200 

*2 

40 

20 

200 

'‘3 

41 

20 

200 

42 

20 

200 

16 
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TABU-;  6 


BTAIITI  J AND  I'INAL  I’OINTS  1-OR  SRAUClilCS 
OF  TWO-DIMKNSIONAF  TEST  FUNCTION 


Run  Number 

St  art  iHt; 

Point 

F 1 n a 1 

Po  i n t 

1 

- . 663 

- .054 

- .670 

- . 045 

2 

-.635 

-.069 

- .675 

- . 046 

2 

-.666 

- . 0 IB 

- .667 

- . 04  9 

4 

-.640 

. 138 

-.671 

- . 04  8 

5 

- . 690 

- . 050 

- .671 

-.04  5 

G 

-.682 

. 391 

-.671 

- . 0^18 

7 

-.507 

. 04  4 

-.675 

- . 04  7 

8 

-.886 

. 165 

-.671 

- . 046 

9 

- . 708 

- . 232 

- . 670 

-.046 

10 

.000 

.000 

- .670 

-.051 

1 1 

.570 

-.990 

- .672 

- . ('4  8 

12 

.752 

.698 

- .672 

-.048 

True  Maximum:  -0.671,  -0.04G 
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STARTIMG  AND  FINAI,  *iOINTS  KOR  SEARCHES  OF  F IVK-DIMENSIONAi.,  TEST  FUNCTION 
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True  Maxinutn  -0.976,  O.^fo.  -0  006,  O.V'l.O,  0.6GO,  0.377 

0.;j3&,  -1. 000,  0.83r>,  -0.:>]2,  0,595,  -0.73-1 
-0.R79,  -O.lf.l,  -0.73^,  -0.6-10.  0,  .R17,  -o.RJQ 


Run  1 w,'!;;  a combined  search  en  the  t wo-dimei.sic'nal  test  surface. 

The  locatif.'ns,  .sizes,  and  junplitudes  of  the  perlUrniance  function 
V:eie  fxiten  above  in  Tables  1 throu(;h  d,  ard  the  location  and  functii 
value  of  the  (global  maxitnurii  were  niven  in  "I  able  4. 

The  system  initially  possessed  40G  units  of  resour ce.s,  or  200 
units  per  dimension.  Fifty  random  trials  were  made,  eiiuallj.ng 
25  per'  dimension.  The  system  resource.s  aie  depleted  on  each  trial 
by  the  difference  betwee-n  unity  and  the  .'la.ximun  function  value, 
whichever  is  larger,  and  the  function  valui;  of  the  trial  point. 

Since  the  m’aimum  function  value  is  appro.'.ir.iatel  y -•!,  the  average 
function  value  should  be  approx! Tiaiely  0,  and  tne  penalty  pt?r 
random  trial  l.s  approximately  1.  This  is  coniirmed  by  the  resource 
coucrumpl  ion  in  PDF!  . the  unbiasei  random  search--52  60  units  for  50 
trials--wh.i  :h  constitutes  a loss  of  13.15  percent  of  the  total  syste 
resources.  The  best  point  found  by  the  random  seaich  in  Run  1 had  a 
function  value  oi  0.9897,  cr  97.4  percent  of  the  maxitmiin  value  of 
1.0164. 

In  order  to  facilitate  comparisons  between  experiments,  the  di-- 
tance  from,  the-  best  point  to  the  frlobal  maximum  has  been  normal- 
ized by  the  maximum  diameter  of  the  hypercubical  space.  For  >ach 
of  the  five  test  surfaces  the  soacc  is  a hypercube  with  e-nch 
dimension  taking  on  values  fro.m  -1  to  -t?  . Therefore,  the  maximum 
instance  between  he  two  p^oinrs  is  2 x (NDIM)‘,  wh(.-re  NbIM  is 
the  dimonsiono  ■'  i ty  of  the  .cs'  surface.  Thus,  for  Hun  1.  the 
n-ormalized  dist.ance  fiom  the;  best  to  the  mani.nium  vs  0.02. 

PDF 2 , the  second  section  of  PDF,  is  a clustering  analysis.  In  this 
case,  the  50  sair.itlc  points  fall  into  12  clu.stv  its. 

”DF3  is  ttie  guided  search  phase.  In  this  case,  the  system  resourtes 
were  not  expended  until  500  trials  (250  trials  pel  dimension)  had 
o-ccurred.  This  is  a .strong  indication  that  th*:-  searcl.  is  focusing 
increasingly  en  nie  higher  performance  class  as  il  sinjnld,  a.s 
explained  above  in  Section  3.2.  The  500  trials  con.sumed  oiily 


347.40  resource  units,  or  0.70  units  per  trial.  This  is  .substan- 
tially lower  ttian  the  average  of  i . O.o  unit.s  per  trial  in  POl'l  , 
indicating  that  the  average  per forinance  is  considerably  higher. 

Tile  resource  consur.ied  in  PDF3  was  S6.85  percent  of  the  initial 
system  resource,  that  is,  all  that  remained  upon  terminating  PDFl . 

The  best  point  found  by  PDF3  has  a function  value  of  1.0155,  which 
is  99.9  percent  of  the  maximum  value.  The  point  it.self  is  a negli- 
gible distance  from  tlie  global  maximum.  Tl.e  difference  in  function 
value.s  between  the  PDF3  best  point  and  the  PDFl  best  point  is 
0.0258. 

The  GARS  search  in  Run  1 is  begun  with  the  best  point  froiri  the  PDF 
search  a.s  shown  in  Tables  6 through  10.  Part  1,  the  random  search, 
is  deleted  since  PDF  has  already  fulfilled  the  purpose.  Parts  2 and 
3 of  GARS,  the  b'ased  search  j, bases,  are  allowed  to  run  for  a maxi- 
mum of  50  it?"ations  each.  Neither  part  can  improve  on  the  starting 
poitit.  (At  t!ie  current  stage  in  the  development  of  GARS,  Parts  2 
and  3 employ  a unimodal  pdf,  cfintorod  at  the  current  bf  st  point, 
to  choo-se  thf'  next  trial  point.  The  multimodal  distribution  gene- 
rated by  the  PDF  al gor thm  iias  not  yet  been  incorporateo  as  dis- 
cu.ssed  in  Section  3.3.)  'Ihe  gradient  search  phase.  Part  4,  is  also 
allowed  50  iteration.s;  it  succeeds  once  in  finding  a better  point. 
The  function  value  oi  the  i-oint  is  1.0158  or  99.9  pci-cent  of  the 
functicin  maximum.  The  normalized  disianoe  from  the  r.  ,nv  best  point 
to  the  global  maximum  is  0.93.  (Note  that  ii  is  possible  for  a 
point  farther  f ro:n  the  global  mc-Ximum  to  have  a higher  func.tion 
vali.c'  than  another,  closer  j^oint.)  'I'h  r fine-tuning  phase  of  GARS, 
Part  5,  j .s  allowed  only  20  iterations;  it  improves  on  the  be.st 
poi  t once?.  The  new  be.st  point  ha.s  a function  value  of  1.0164, 
the  maximum  function  value,  and  i.s  at  a near-zero  normal  i/a?d  dis- 
tance from  the-  global  maximuni. 


2 3 


t:r;il  1 in  Ran  1,  then,  GARS  was  allowed  to  make  170  iterations 
and  had  two  successes.  The  best  poirit,  its  percentage  of  the 

value  and  it.s  distance  from  the  global  maximum,  are  as 
giiant  for  Pait  5.  The  improvemc'nt  of  the  GARS  highe.st  function 
vaiue  over  the  PDF  iiighest  function  value  i.s  0.0009. 

^ si lUnsults  and  Cone  1 us i on.s 

Tables  ,\-l  through  A-11  list  results  by  run,  Figuro.s  1 through  3 
resent  the  mo.st  significant  ovoral  1 results  of  the  investigation. 

Figure  J shows  the  percent  of  ma.ximum  function  value  achieved  by 
i'lR-'  a.s  a i'uncticn  of  the  estimated  tmmber  of  trials  per  dimension. 
ThC'  c.stimatcd  number  of  trials  for  a given  e.xperiment  is  the 
in  'tber  o!  trial5i  the  systen:  can  be  expected  to  m;:ke  befoi'e  running 
(.lilt  of  rc'sou  >'ces . It  i.s  computed  by  dividing  the  amount  of  initial 
systen.  recource  by  the  e.stimated  average  re.sourcc  con.sui.iption  per 
trial.  In  this  problem,  the  values  of  the  performance  function 
ranged  from  -1  to  i-l,  and  the  estimated  average  re.source  consump- 
i ion  is  one  unit  per  trial.  Then  in  the  case  where  the  initial 
•sysK.-m  resource  is  400,  the  estimated  number  of  trials  is  400; 
il  the  d in.ens  i raial  i y is  2,  tin?  estimated  number  of  trials  per 
dii'nen.sion  is  200. 

Figure  1 also  illu.strates  the  rate  at  which  the  global  maximum 
IS  readied  with  re.spect  to  tlu-  number  of  trials  per  dimension 
a s(?arch  is  allowed  to  make.  It  can  be  .seen  that  given  enough 
lu  sources  to  take  ap.uroximately  200  tample.s  per  dimension,  PDF 
can  coinorge  to  the  global  maximum.  This  is  not  practical  for 
problems  of  high  dimensionality;  GARS  or  a similar  technique 
is  needed  to  .supji  1 emen t PDF  fo*  thc.se  higher  dimensional  surfaces. 


T'erccnl  .,'J  Max  irriiini  Ach  i ('Vf'd  by  I'>DF 


Figuves  2 ;md  2 indicHlr-  t inl'lueiu-e  of  PDF  <-)!i  CiAHS.— ' Figui'n  2 
i 1 1 usi  rafty  t lie  nainher  ol  sueees.-^f  ill  irove.s  di  na-n  s i un  (.sLai't- 

i !ig  1 j M!".  i he  F!n'  hesl  iieinl  ) that  GAHS  i-eijuires  In  a'diii’ve  lIv 
i;:i\i!!iuin  lunelioi:  \ a I u<  , ay  a tuiieljei,  of  the  e.stiiuaied  number  of 
trials  per  dimension  in  PDFl.  More  time  spent  in  PDF  imprinc's 
the  pei'lormanco  surface  model  and  thus  enables  GARS  to  operate 
nil, re  effieier.t  ly.  (jAHS  wc-uld  he  evi.'ii  more  ellectite  if  ai.“re 
modified  ti'  take  fuller  a.d\.antage  oi  t h<-  PDF  model  . 

Figure  3 shows  clearly  that  GAHS  in  combination  with  PDF  is  more 

powerful  than  GARS  alone.  For  each  ol  the  five  performance 

surfaces,  the  number  of  succe.ssful  moves  per  dimen.sion  for  GARS 

to  achiei-e  the  maximum  is  given  as  a function  of  dimensionality 

* — 
for  each  of  three  siai'ting  points;  x , the  PDF  best  point;  x, 

the  cluster  center  <•.  t the  best  periormance  class;  and  Xj^ , the 
cluster  center  from.  t,h.,>  tow  pcrior:n:>n-‘o  cl-'ss  used  as  an  inde- 
pendent stai'liiig  point.  In  the  case.s  wheie  thei'O  v as  more  than 
one  cluster  in  the  top  perfoi'mance  cla.ss,  the  number  of  mo\'es 
plotted  is  the  average  number  of  moves  for  all  runs  whic!'.  ro-oched 
tiie  maximum  (see  Tahlo  h).  Except  foi'  one  case  (N  - 15  lor- 
X - Xjj).  GARS  reaolu-d  the  miaxiniuin  more  quickly  1 rum.  x than  i laim 
— * 

Xj^ , ana  1 T'oni  x . ('Un:  niinir.ui:;  at  N = 10  probably 

a f uio't  i jn  of  the-  pai't  tctil  a*-  test  surface  used.)  In  the  i ase  of 
X --  Xpj  for  N-15,  the  GARS  random  search  quickly  found  a point 

with  a function  value  almost  as  large  as  tho.st'  cases  using  the 

♦ 

PDr-gf'U'>rat  ed  v a.nfl  x as  stalling  points.  This  point  v.a.s  loi.ateii 
in  a more  favc.rablo  po.silion  on  t h('  surfac'  so  that  GARS  spent 
less  time  on  the  biased  searchus  and  the  gradient  search.  It 
i.s  readily  appaifiit  thai  PDF  is  a valuaiile  ftrcl  imi  nn.^'y  step  to 
GARS,  and  can  nrufrtably  bo  substituted  tor  the  rand,  rm  search 
phase  oi  GAHh. 

—'In  Figure  2,  t lie  value;  for  lire  i en -d  i men.s  i on  al  problem  has  been 
set  off  .seiiaraiclv  a.r.ri  not  Itei  n i ncl  ud<.'d  in  i he  i nt  e;rpol  at  i ons  . 

The  ten-dimensional  searrit  di.scovered  its  best  point  in  the 
lirst,  smalle.st  (100-ooint)  random  sampling  in  PDF.  It  did  not 
imiprove  on  thi.s  \alui^  in  a sub.sequent  200-point  random  ocarcli 
or  Ml  tlie  guided  sea rclu.s  loll<nving  the  random  searches.  Thus 
the  PP/F  results,  the  GAHS  stariiiig  unir.ts.  anil  the  GARS  porlonr.- 
ancf-  aro  not  comtiarab  lu  to  thusi-  lor  i lu  ■ iitlier  |.ii  ubieiii.s . 
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Numbt^r  of  GARS  Moves  per  Dimension 
Required  to  Achieve  Maximum 
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'(  PDF  enables  GARS  to  find  not  only  the  global  maximum,  but 

other  maxima  in  the  high  per f orina.nce  class.  The  ability  to 
locate  local  maxima  can  be  significant  for  certain  proble;,is. 

For  instance,  it  may  be  desirable  that  a system  be  able  to  operate  ] 

in  more  than  one  region  in  its  parameter  space.  In  other  cases,  I 

operating  at  or  near  the  global  maximum  may  be  unfeasible  if 

this  value  is  close  to  a catastrophic  operating  region.  (See  j 

Run  24,  Table  A-6  and  Run  30,  Table  A-8.)  \ 

I 

i 
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Sl-XTION  IV 

ASSESSMENT  OF  TIIE  COMl’LKXITY  OF  A SEARCH  (OPTIMIZATION)  PROBLEM 


4.1  Need  for  a Measure  of  Complexity 

A major  problem  is  to  estimate  the  "complexity"  of  a search  problem, 
(’omplexity  of  the  performance  hypersurface  can  be  defined  to  be 
(1)  the  number  of  modes  (peaks),  (2)  their  locations  relative  to 
each  other.  (3)  their  shape  and  volume,  and  (4)  the  estimated 
maximum  performance  value  within  each.  If  this  information  were 
obtainable  belore  beginning  a search  problem,  these  data  would  not 
only  specify  the  complexity  of  the  search  problem,  but  would  also 
probably  identify  the  most  appropriate?  searcli  strategy. 

4.2  Applicability  of  Cluster  Analysis 

It  was  shown  in  the  previous  effort  (11,  12,  13)  that  a clustering 
algorithm  can  be  useful  in  pointing  out  regions  of  a performance 
surface  that  liave  .significant  locatioiis,  volumes,  u;-  pe.?' iortnance? 
values.  It  can  locate  both  peaks  (maxima)  and  valleys  (minima), 
and  give  information  concerning  the  size,  location,  and  approximate' 
extreme  value  of  each- 

The  procedure  to  be  followed  is  very  similar  to  Parts  1 and  2 of 
PDF:  a random  search  followed  by  a clustering  analysis—^  (see 

Section  3.4).  The  performance  space  should  be  sampled  extensively 
enough  to  ensure  that  no  large  regions  arc-  neglected;  that  is, 
the  space  must  be  sampled  fairly  evenly,  with  no  large  unsampled 
gaps,  to  minimize  the  possibility  of  missing  a potentially  signi- 
ficant extremum. 

Following  the  random  .search,  the  .sample  points  arc  divided  into 
periormance  classes  according  to  their  associated  function  values. 
The  classes  .need  not  have  equal  function  value  ranges  (i.e.,  the 


The  Mucciardi -Gose  CLUSTR  algorithm  was  used  (10). 
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dtlierence  butweun  upper  and  lower  bounds).  \>r  contain  equal 
numbers  of  sample  points.  The  nvimbor  of  classes  is  limited  only 
by  the  number  of  sample  points;  that  is,  there  should  be  a 
sutticient  number  of  points  in  each  class  to  make  a clustering 
analysis  useful. 

Each  performance  class  is  clustered  separately.  The  following 
information  is  determined:  number  of  cells  in  each  cla;.;s, 

location  (i  .e . center  or  mean)  of  each  cell,  size  of  each  cell, 
number  of  points  in  each  cell,  and  a list  of  the  identity  of 
each  point  in  each  cell.  In  addition,  an  optional  "intei'cell 
analysis"  may  be  performed.  This  analysis  locates  and  identifies 
pairs  o'  ceils  which  ov'erlap,  whetl’.er  they  are  in  the  same  class 
oi  in  different  classes.  For  a given  pair,  the  intercell  analysis 
also  estimates  the  percentage  of  each  cell’s  volume  falling  within 
th.eir  common  region. 

4.3  Illustrative  Kxtmiplc 

A sample  problem  is  the  clearest  way  of  illustrating  the  utility 
of  clustering  analyses  as  a means  of  assessing  complexity. 

The  two-dimensional  performance  surface  employed  in  this  study 

was  di^:.cribed  in  Tables  1 tlirough  4.  Tlie  locations,  shapes, 

and  sizes  of  the  five  performance  modes  are  illustrated  in  Figure  4. 

The  small  ellipse  for  each  mode  is  located  ar  one  "size  factor" 
distance  from  the  center  of  the  mode--the  large  ellipse  at  tv  ice 
that  distance.  Tlie  performance  surface  consists  of  three  modes  of 
positive  height  (3,  4,  and  5)  at  the  left  of  tho  surface,  and  two 
modes  of  negative  height  (1  and  2)  at  the  right  of  the  surface; 
that  is,  three  peaks  and  two  valle>s.  There  is  a moderate  amount 
of  influence  between  Modes  1 and  4,  and  2 and  3;  strong  influence 
between  .Modes  3 and  5,  and  4 and  5;  and  very  little  influence 
between  any  other  pairs. 
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For  this  problem,  the 

four  perf orm.ance 

classes  are  divided 

as 

.•'n  1 lows ; 

Class  1 

0.5  < P 

Class  2 

0.0  < P < 

0.5 

Class  3 

-0.5  c P < 

0.0 

C 1 as  s 4 

P < 

1 

O 

Given  the  amplitudes 

of  the  five 

modes 

(as  listed  in  Table 

3) 

we  should  expect  the  clustering  analysis  to  construct  one  or  more 
clusters,  close  together,  in  each  of  the  two  extreme  classes  and 
several  distributed  over  a wider  area  in  each  of  the  two  middle 
classes.  A middie-class  cluster  represents  one  of  two  possi- 
bilities; a performance  mode  with  its  e.xtreme  value  in  the  perform 
ance  class  in  question,  or  a region  of  transition  between  a 
higher  class  and  a lower  class.  The  intercell  analysis  described 
in  Section  4.2  above  is  useful  in  determining  what  a given  cell 
represents.  In  the  first  case--a  mode  with  its  extreme  value  in 
the  class  in  question--the  cell  will  overlap  primarilv  with  cells 
of  the  next  Ic-wer  class  if  it  contains  a peak,  or  of  the  next 
higher  class  if  it  contains  a valley.  In  ti»?  second  cast; — a transi 
tional  region  between  a higner  and  a lower  class — the  cell  will 
overlap  with  cells  from  both  classes,  and  possibly  with  other  cells 
from  its  own  class. 

The  cluster  analysis  was  performed  first  on  50  points  taken  randomlv 
from  the  suiface  as  sliown . It  produced  12  clusters,  distributed  as 
follows : 


Cl  ass 

1 

- 

One  cluster 

Class 

2 

- 

Fiv^e  clu.sters 

Cl  ass 

3 

- 

Five  clusters 

Class 

4 

_ 

One  cluster 
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The  locutions,  shapes  and  sizes  oi  tlio  clusters  ai'e  illustrated 
in  Fi;;ure  5.  Notice  that  the  surface  has  been  incompletely 
sampled;  some  of  the  outer  portions  of  the  surfat-e  are  not  included 
in  any  of  the  clusters. 

However,  even  given  the  skelcliinoss  of  the  sample,  the  cluster 
analysis  gives  very  significant  result.^.  Cell  1,  the  single 
cluster  ii'.  Class  1,  contains  the-  global  m;ixinn..'Vi ; li-.  its  po.sjti<5n 
and  size  it  also  indicates  tliat  the  mutual  itn'Juencu  of  Mr.dc'S  4 
atui  5 has  broadened  the  area  of  pei  lormanco  o.  Cla.*=:.s  1 . Cells  2 
througii  4 describe  the  region  resulting  : rttr.:  Mode  4 and  tne 
waning  influence  of  Mode  5.  Colls  5 and  6 are  limited  vo  one 
point  each  due  to  the  limited  sttmpiing  in  that  region,  but 
they  indicate  the  Class  2 region  resulting  from  Mode  h and  sortie 
.slight  influence  of  Mode  b . 

fells  7 and  8 cover  so.me  of  tiie  tran.sii  tonal  art;;,  berwe'on  the 
posiiivt.-  Modes  3 through  5,  aitd  the  negative  .Modes  1 and  2. 

Cell  9 includes  not  only  a good  deal  of  transitional  area,  but 
also  most  of  the  region  dominated  by  Mode  2.  Cell  lu  cunltiins 
only  one  point,  but  that  point,  .since  it  is  located  on  the 
outer  boundary  of  Cell  12  (the  only  cell  in  Class  4),  helps 
tC)  define  the  limits  ol  Class  4.  Cell  11  ais(.>  contains  a single 
point:  ii  helps  to  determine  i,hr  boundary  beiween  Cla.ss  2 and 

Cl as.s  3 

Ceil  12  contains  th('  center  of  Mode  1;  its  .shiit  downward  is  due- 
to  tire  influe"ce  ol  Modes  2 (ne'gative)  and  4 (po.sii  i\'e).  It  is 
noticeai:iy  larger  than  Cel!  1 .s..n(u  Mode  1 ;s  larger  tlian  Mode  b. 
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Class  i (Cluster  1) 
Class  ?.  (Clusters  2-6) 
Class  3 (Clusters  7-li) 
Class  i (Cluster  12) 
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In  these  examples,  the  nodes  in  the  middle  perfornuince  classes 
t,Classes  2 and  3)  were  not  isolated.  To  find  them  it  would  only 
be  necessary  to  increase  the  random  sampling  and  to  make  the 
performance  class  values  narrower.  Eventually,  each  peak  would  be 
repre.sented  by  a cell  containing  the  local  maximum,  .surrounded  by 
rings  of  ceils  from  decreasing  performance  classes.  Similarly, 
each  valley  would  produce  a cell  containing  the  local  minimum, 
surrounded  by  rings  of  cells  of  increasing  performance  classes. 

The  function  value  of  the  cell  centers  in  Classes  1 and  4 would 
givf'  fair  approximation;  of  the  global  maximum  and  minimum. 

A second  cluster  analysis  was  performed,  this  time  on  a 100-point 
random  sample  of  the  same  surface.  The  results  are  shown  in 
Figure  6.  Due  to  the  increased  density  of  the  search,  the  model 
defines  the  boundaries  of  the  four  classes  more  precisely.  It 
is  easier  to  perceive  the  transition  trom  Class  i through  ('la-sse;;  2 
and  3 to  Class  4.  In  addition,  it  is  clearer  that  the  cell  in 
each  extreme  class  is  actually  surrounded  by  a ring  of  cells  in 
the  next  lower  class  (the  Class  1 cell  surrounded  by  Class  2 cells) 
cr  the  next  higher  class  (the  Class  4 cell  surrounded  by  Class  3 
cells). 

In  tests  run  on  problem.s  of  higher  dimon.sionality , it  becomes  some- 
what more  difficult  to  intei-pret  the  results.  However,  it  is  evi- 
dent that  cluster  analyse.s,  when  properly  interpreted,  are  able 
to  provide  excellent  information  con(;erning  the  comple.xity  of  high- 
diinonsional  surfaces.  This  can  be  .seen  from  the  fact  that  a cluster 
ing  analysis  of  any  random  .sampling  of  one  of  the  surfaces  i ployed 
in  this  project  always  yielded  at  least  one  cell  which  contained 
the  global  maxlmu.m.  In  addition,  the  number  of  cells  in  any  class 
was  equal  to  (and  in  most  cases  greater  than)  the  number  of  per- 
formance modes  in  that  class,  with  two  exceptions,  as  shown  in 
Table  11. 
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Class  1 
Class  2 
Class  3 
Cl  ass  ‘1 


l lf.nu-:  G : CI.L'STKKINV,  AXAI.YSIS  OF  THF  TWO-DIMENSIONAL 

I’EHIOH.MAKC’-;  TEST  FUNCTION 
(THOM  A ir.O-POINT  SAMPLE) 

MS 


TABLE  11 


COMPLEXITY  OF  PERFORMANCE  SURFACE 


ESTIMATED  BY 

CLUSTER 

ANALYSIS 

KTOT-^ 

Modes 

In  Class 

Cells 

In  Class 

N 

_1 

2 

_3 

_4 

1 

_2 

_3 

_4 

2 

50 

1 

2 

1 

1 

1 

5 

5 

] 

2 

100 

1 

o 

1 

1 

1 

9 

12 

1 

2 

200 

1 

2 

1 

1 

4 

15 

24 

5 

5 

100 

2 

1 

0 

2 

2 

11 

7 

3 

5 

200 

2 

1 

0 

2 

8 

24 

12 

5 

10 

100 

3 

0 

0 

2 

1 

4 

3 

6 

10 

200 

2 

{) 

0 

2 

2 

3 

6 

12 

15 

200 

3 

0 

0 

2 

3 

13 

6 

9 

20 

200 

3 

0 

0 

2 

3 

15 

6 

8 

^ j 

Number  of  Points  in  Random  Sample. 


39 


The  two  exceptions  are  for  Class  1 in  the  two  10-dimensional 
searches.  In  the  10-dimens lonal  problem,  the  three  high  perform- 
ance modes  are  usually  close  in  dimensions  6,  9 and  10;  Modes  3 
and  4 are  close  in  dimension  7,  and  4 a.nd  5 in  8.  (See  Table  1). 
Their  strong  mutual  influence  results  in  a large  region  of  high 
performance,  which  is  interpreted  by  the  clustering  algorithm, 
as  a single  cell  (.in  the  first  case)  or  as  two  overlapping  cells 
( in  the  second ) . 

4.4  Conclusions 

It  can  be  seen  that  clustering  analysis  fulfills  the  needs  of  a 
complexity  assessment:  it  can  discover  peaks  and  valleys, 

report  their  locations,  estimiate  their  sizes  and  volumes,  provide 
information  for  search  initiation,  and  approximate  function  values 
In  addition,  it  can  locate  and  characterize  transitional  regions. 
Therefore,  the  clustering  algorithm  does  provide  readily  accessibl 
information  about  the  structure  of  a given  performance  space. 
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SECTION  V 


EXAMINATION  OF  AN  IMAGE-PROCESSING  PRObLKM 
WiTiI  POTENTIAL  RPV  APPLICATION 

One  persistent  problem  in  both  communication  and  weapons  sysi:e:;:s 
development  has  been  the  efficient  encoding  and  transmittal  of 
digitized  images.  The  technique  most  widely  used  at  pre.^i-ni  is 
as  follows;  Each  line  of  a digitized  image  is  treated  as  ;i  wa\f'- 
form,  the  waveform  is  encoded  by  performing  a Fourier  transfoi 
The  resulting  set  of  Fourier  coefficients  for  that  line  is  thc-n 
transmi tted , and  the  picture  is  reconstructed  line-ny-line  via  nn 
inverse  Fourier  transform. 

The  set  of  Fourier  coefficients  contains  as  niany  elem.ents  as  does 
the  line  of  data  itself  (that  is,  the  Fourier  transform  of  a 
100-element  line  has  100  coefficients).  It  has  been  found  that 
it  is  possible  tc  discard  some  ot  the  coefficients  associated  with 
the  highest  frequencies  and  transmit  the  remaining  fraction 
(1,  4,  5,  6,  7).  A problem  arises  in  deciding  how  rnmny  coeffi- 
cients to  retain.  Of  course,  fewer  coefficients  retained  and 
transmitted  implies  faster  and  easier  tran.smission  and  recon- 
struction. The  penalty  lies  in  poorer  image  resolution. 

Since  the  area  of  fast  Fourier  transformi  (FFT)  retention  and  rec.on 
struction  has  been  thoroughly  explored,  it  seemed  beneficial  to 
approach  the  basic  problem.  — high  accuracy  of  reconstruction  with 
minimal  data  ti'arismission  — in  an  encirely  new  manner. 

Instead  of  viewing  the  digiti.^'ed  imago  a 1 ine-at-a-time , the 
iiTiage  can  be  considered  as  a matrix  with  each  point  fele.ment) 
pcjssessing  three  de.scr Iptors ; row.  coium.n,  and  gray  level  infor- 
mation (e.g.,  reflectivity,  visual  density,  etc.).  Approaching 
it  in  this  way  enables  one  to  visualize  the  image  as  a three- 
dimensional  per fortTiance  surface.  That  is,  gray  level  informat  i('n 
(y)  can  be  regarded  as  a function  of  location  (row,  x^.  and 
column,  X2^- 
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Tnis  approacli  ha.s  three  main  advantages  over  the  row-by-row  approach. 
First,  the  eye  does  net  perceive  an  ima^e  in  horizontal  bands, 
but  as  a whole;  an  encoding  technique  that  does  the  same  can  poten- 
tially achieve  improved  subjective  information  content.  Second, 
by  treating  a continuous  area,  an  algorithm  will  be  more  sensitive 
to  interesting  features  (e.g.,  large  patches  of  one  gray  level,  or 
a repetitive  pattern)  than  a row-by-row  analysis  can  be.  Third, 
identification  of  regions  (i.e.,  "clusters")  of  a given  gray  level 
in  the  image  also  provide  the  first  stage  in  recognizing  classes 
of  objects,  (21  "targets"  in  the  camera's  visual  field. 

The  problem  of  image  encoding  resembles  more  closely  the  problem 
of  assessing  complexity  (Section  4 --  locating  and  describing  ..11 
extrema)  than  it  does  the  problem  of  optimization  (Section  3 — 
locating  a single  extremum).  Therefore,  clustering  analysis 
appeared  zu  be  quite  useful. 

5.1  Description  of  Problem 

The  problem  considered  in.  this  portion  of  the  project  wa.s  the  en- 
coding and  reconxtiuction  of  a photograph  of  downtown  St.  Louis, 
shown  in  Figure  7.  The  picture  contains  very  light  areas  — the 
sunlight  reflecting  from  the  metal  arch  — and  very  dark  areas  — 
.shadows  of  buildings.  It  is  a rather  detailed  and  complex  picture, 
particularly  because  areas  of  corruTion  gray  level  are  not  always 
contiguous . 

The  figure  was  digitized  by  division  into  32  rows  and  32  columns, 
or  1,024  separate  locations.  Two  hundred  and  f.ifty-six  levels  of 

D 

gray  were  used  for  each  of  the  1,024  locations.  The  2 gray  levels 
were  condensed  into  five  bands  from  1 (black)  to  5 (white).  The 
bands  were  approximately  logarithmically  chosen  as  I'ecornmended  in 
reference  ( 8 ) : 


Gray  Level  Range 


0 - 34 

35  - 79 

80  - 138 

139  - 215 

216  - 255 


Reduced  Range 

1 (black) 

2 

3 

4 

.5  (white) 
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A s,t;heiTU=  lor  printing  out  the  digitized  picture  by  cornputei  i>.as 
devised  e.s  recorr.nu'nded  in  reference  (8).  The  computer  ree<in- 
structior  is  shown  in  Figure  8. 

The  digitized  image  was  encoded  and  reconstructed  in  twc>  ways. 

First  , each  class  was  clustered  Individually  and  the  picture  was 
reconstructed  by  assig.  ing  each  point  in  the  32  x 32  location 
matrix  to  the  cell  nearest:  to  it.  Second,  and  independently,  the 
picture  was  subjected  to  a low-by-row  Foui ier  transform  and  recon- 
structed several  tirne.s,  varying  the  number  of  coefficients  retained. 

5.2  Clustering  Results 

A clustering  analysis  was  performed  separately  on  each  per fornuince 
class  in  the  following  manner. 

The  following  number  of  cells  was  generated  fcr  each  class; 


Class 

1 

( black) 

- 

17 

Class 

2 

- 

21 

Class 

3 

- 

17 

Class 

4 

- 

99 

Class 

5 

( wh i t e ) 

- 

9 

163 

Figures  9 through  13  show  the  locations  of  th.e  various  cells  in 
each  performance  class. 

The  picture  was  reconstructed  fio.m  the  163  clusters  in  the  follow- 
ing arbitrary  way;  Each  point  was  e.xafninod  separate-ily ; the 
nearest  cell  was  found  and  the  point  was  assigned  to  the  perforrranco 
class  associated  with  that  cell. 
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FIGURE  9:  CLUSTER  STRUCTURE  OF  GRAYNESS  CLASS  NO. 


FIGUHL  10;  Cl.l.-STER  STRUCTnu:  OF  GRAYNFSS  CLASS  NO.  2 
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The  1,02'!  points  in  the  picture  matrix  were  separated  into  the 
live  performance  classes.  The  table  below  shows  the  number  of 
points  per  class,  and  the  perceiitatte  of  the  total  number  of  points 
in  each  class. 


Class  Number 

Number  of  Points 

Percentage  of  Total  Points 

1 

74 

7.23 

2 

82 

8.01 

3 

77 

7.52 

4 

744 

72.66 

5 

47 

4.59 

] .024 

100. 

By  far  the  greatest  number  of  points  is  in  Clas,  4,  which  is  the 
gray  level  that  includes  all  of  the  sky  (.see  Figure  7). 

Initially,  the  matrix  information  was  input  to  the  cluster 
analysis  row-by-row,  from  top  left  to  bottom  right.  However,  for 
subsequent  clustering  analyses,  the  points  within  each  of  the 
five  classes  were  randomly  presenred.  This  enabled  the  clustering 
algorithm  to  exploit  its  ability  to  locate  regions  of  interest. 
Additionally,  avoiding  a row-by-row  analysis  emphasized  the  con- 
trast t'etween  encoding  by  clustering  and  encoding  by  Fourier  trans- 
forms. The  reconstructed  i»icture  for  the  clu.ster  analysis  is 
shown  in  Figure  14. 

5 . 3 Foui'ier  Ti  ans  form  Resul  ts 

For  the  Fourier  anaiy.ses,  the  picture  was  transformed  and  encoded 
row-by-row,  resulting  in  32  sets  of  coefficients,  each  set  contain- 
ing 32  coefficients.  The  picture  was  reconstructed  in  five  ways, 
each  using  the  following  number  of  coefficients; 
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FIGURE  14:  RECONSTRUCTION’  OF  PICTURE  FROM  CLUSTER  STRUCTURE 
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In  a'l  cases,  the  highest  frequency  coefficients  were  eliminated; 
this  means  that  the  data  were  low-pass  (spatial)  filtered.  Since 
the  first  two  reconstructions  are  very  similar  to  the  original 
transform  both  in  number  of  coefficients  and  ir.  reconstruction 
accuracy,  only  the  last  three  were  considered  extensively.  Their 
reconstructions  are  shown  in  Figures  15  through  17. 


i 

5.4  Comparison  of  Resul ts  1 


The  subjective  accuracy  of  the  cluster  reconstructed  pictures 
is  less  than  that  ol  any  ol  the  Fourier  reconstructions.  However, 
this  is  probably  attributable  to  the  coarseness  of  the  digitiza- 
tion and  the  resulting  large  size  of  the  clusters  --  particularly 
those  in  Class  4 (as  shown  in  Figure  12). 

Various  objective  measure.s  of  accuracy  enable  more  quantitative 
comparisons  to  be  made.  To  begin  with,  it  is  important  to  consider 
the  reduction  in  transmitted  information  achieved  by  each  of  the 
algorithms.  The  amount  of  data  initially  available  is  1,024 
■performance  values"  (i.e.,  gray  levels). 

The  amount  of  information  transmitted  after  clustering  is  equal  to 
two  scalais  for  each  one-point  cell  (location  coordinates)  and 
lour  scalars  for  each  larger  cell  (two  location  coordinates  and 
two  size  factors).  The  a.mount  of  information  transmitted  by  the 
Fourier  transform  method  is  the  product  of  the  numbei’  of  rows 
and  the  number  of  coelficjonts  rt'iained  per  row.  Table  12  lists 
tne  reduction  in  information  volume  for  the  cluster  analysis  and 
the  last  three  Fourier  analyses.  There  are  two  waj's  of  describing  the 
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riGL'RE  15:  RKCONSTHUCT ION  OF  PICTURE  FROM  FOURIER  TRANSFORM 

(7  LOW  FREQUENCY  COEFFICIENTS  RETAINED) 
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TABLE  12 


REDUCTION  AND  COMPRESSION  OF  INFORMATION 
BY  VARIOUS  ENCODING  METHODS 


Method  of  Encoding 
Infoi’niat  ion 

Number  of  Data 
Cells  ( N ) 

Reduction 

I’actor 

(l-N/1024) 

Compress  ion 
Factor 
( 1024/N) 

Original  Picture 

1 ,024 

0.00 

1 . 00 

Cluster 

604 

0.41 

1 .70 

Fourier  3 

736 

0.28 

1 .39 

Fourier  4 

480 

0.53 

2.13 

Fourier  o 

224 

0.78 

4 . 57 
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reduction  ot  in  t or»’'i  t ion  ; one  is  the  rt>duc  t i or,  f i^.clor  , ti 


frnction  of  original  information  that  has  been  eliminated 

R - 1 _ -l!  - 
1024 

where  N is  the  number  oi  scalars  transmitted  lor  a givcui  recon- 
struction; the  -second  is  the  conspress  ion  f ac  titr , the  ratio  ol 
original  in  forma  t ioi;  to  transmitted  in  foniiat  ion 

^ 1 02-1 
e - . 

The  cluster  analysis  ccnipures  favorably  to  the  Fourier  analyses; 
its  reduction  in  inforciation  falls  between  the  tliird  and  fourth 
Fourier  transforms. 

Another  way  cf  assessing  rec.onstruct ions  is  by  comparing  their 
relative  accuracies  point-by -point  for  the  1,024  points.  Table  13 
shows  'confusion  matrices"  and  resultant  percentage  accuracy  for 
each  of  the  lour  reconstructions.  Again,  the  cluster  reconstruc- 
tion compares  fa\’ocably;  its  accuracy  is  only  slightly  less  than 
that  of  the  fourth  Fourier  transform. 

Figures  18  and  19  summarize  the  results  of  the  two  tables.  The 
accuracy  of  the  cluster  results  fall.s  about  7 percent  below  that 
which  would  be  expected  from  a (hypothetical)  Fourier  eticoding  with 
the  same  reduciiou  and  compression  lac  tors. 

These  results  are  very  encouraging,  particularly  in  view  of  two 
considerations:  First,  the  iiaiited  range  cl  the  performance 

classes  is  more  favorable  to  the  Fourier  method  than  to  clustering. 
The  clustering  algorithm  can  readily  deal  with  almost  any  number 
of  gray  lev(;ls;  but  given  a wide  range  of  values  to  model,  the 
smoothing  effect  ol'  a truncated  Fourier  transform  would  tend  to 
elitfiinate  the  extremie  values,  which  might  be  the  najst  informative. 
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TAIU.K  i;< 

CONFUSION  MATRICKS  OF  VARlOIi  IMACiK  RKCON'SJHUCl  1 ONS 
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FIGlRi;  19:  IMAG1-;  KKCONSTHl.’CT  I OX  ACC'L'KACY  AS  A FUNCTION  OF 

INFORMATION  COMPHFSS 1 ON  I^OH  TWO  HKCONSTRUCTION 
PHOCFm'HhS 
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SfM-c^iid  , 1 lu'  I (.H-onslriu;  t i on  1 rom  ilnst.<-rs  used  tuM’o  uas  ;in  :irbi~ 

travy  and  very  sMnpl<,-  nu-lhod.  Thc,>re  are  several  othei'  rm'thod;-: 
to  ehoose  1 rom ; for  oNamplc'.  eompntinu  thv'  per  f Orniane  e ol  ea'di 
pivin;  as  a weifj;htt'd  s'jm  of  the  values  of  the  nearest  flusters 
or  oven  -if  all  t hi?  clusters.  A more  .sophi  s t fixated  reconstruction 
t)i  ocedare  could  inc.i  easo  1 In  a<.  curav  y of  re ei>nst  ruei  1 or.  from 
clu.sters  well  above  that  fj’om  a Fourier  tran.sform  with  the  same 
volume  of  information. 

5.5  Conclusions 

Clu.stcring  analysis  is  certainly  worth  consideration  as  a method 
of  image  encoding  and  r eeonstru<;, t i on  . On  ,a  problem  ir.volving  a 
very  coar.st?  sample  size  (32x321,  reconstruction  from  dusters 
using  a sirriple  techniciue  was  only  slightly  less  accurate  than 
reconstruct  ion  via  a Fourier  transform  with  roughly  t he  oame 
reduction  of  iniormation  volume.  in  nddij.^n  to  ptovidlng  a 
good  reconstruction,  clustering  analysis  can  find  regions  of 

liossible  interest  within  tJhe iii‘<t_ge  because  of  its  ability  to  c.on- 

sLder  the  image  as  a whole  rather  than  row-by-row.  Thi.s 

charac  t or  i ,s  t ic  considerably  enli.ancos  its  value  as  a tool  in  image 

pattern  recognition  and  c 1 a.'^si  f i ca  t i on  . 
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C0^!CLL■3I0^’^,  A?,'D  RKCOMMENDATIONS 


Tho  work  effort  in  this  study  has  been  devoted  to  extension  and 
further  development  of  search  altrorit  hms  of  utility 
for  self-organizing  control  systt;ms  relevant  to  Air  Force  ne<>d;j. 
Applications  of  these  algorithms  have  demonstrated  their  capabilities 
for  use  in  optimization  of  the  parameters  of  human  far.tors  nod-.;ls 
that  desci'ibe  chai'.acterist  ics  i f man-machine  interface  proolems. 


The  results  of  this  study  can  he  sunmarizod  as  follows 

• Search  methods  developed  in  the  previous  study  have 
been  extended  to  h i gher- dimensiona 1 , mul t imoda 1 
problems  and  have  been  sliown  to  be  very  effective'. 

o A comno.site  search  algorithm  incorporating  both  the 
pdf-giiidcd  search  and  the  guided  accelerated  random 
search  has  been  simulated  and  found  to  be  more  effec- 
ti'.c  than  either  search  algorithm  alone. 

« Clustering  analysis  for  asse.ssing  the  complexity  of 
a search  surface  has  shown  to  be  of  value. 

# A lu'w  method  for  ’.mage  encoding  has  boon  formulated 
that  appears  to  be  potentially  superior  to  methods 
currently  in  use. 

Further  work  should  be  initiated  to  .'r-eek  ways  of  making  current 
techniques  more  powerful,  and  to  broaden  their  areas  of  application. 
\V(?  recommend  that  the  following  areas  oe  investigated: 

*»  Extensive  effort  should  be  devoted  to  developing  an 
algorithm  that  combines  the  best  features  of  PDF  and 
GARS.  The  substitution  of  PDF  for  the  first  portion  of 
GARS  has  been  demonstrated  to  be  ar  effective  strategy. 

The  next  step  should  be  to  combine  them  into  a single 
algorithm  and  to  insert  PDF  into  the  statistically 
biased  search  phases  of  GARS  so  that  its  multimodal  pdf 
model,  which  extends  throughout  the  performance  space, 
can  be  used  to  guide  GARS  in  the  choice  of  trial  points, 
rather  than  the  present  unimodal  pdf  model.  Additionally, 
provision  should  be  made  to  explore  local  maxima  as  wi.-ll 
as  the  global  maximum,  should  this  be  desirable. 


• The  use  of  clustering  analysis  to  measure  the  complexity 
of  a search  (optimization)  problem  should  be  examined 
more  '■.losely.  A technique  is  needed  that  will  make  the 
cluster  results  nion.'  readily  understood  as  a measure 

of  con.plexity . This  is  especially  important  in  problems 
of  high  dimensionality,  where  interpretation  of  clustering 
results  can  be  difficult. 

• The  use  of  clustering  analyses  to  encode  and  reconstruct 
images  should  be  developed  using  pictures  with  finer 
divisions.  A reconstruction  function  should  be  formulated 
that  will  take  more  advantage  of  the  benefits  to  be 
gained  from  clustering  --  in  particular  its  sensitivity 

to  regions  of  interest  in  the  picture.  The  lattei  ability 
will  directly  couple  this  new  encoding  technique  to 
pattern  recognition/classif loatlon  interests. 
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